Inquery System Overview 1. Description of Final System 1.1. Approach 1.2. Processing Flow Country Recognizer: for Each Mention of a Country Company Name Recognizer: for Each Citation of A
نویسندگان
چکیده
(which includes MCC as a subcontractor), has focused on the following goals: Improving the eeectiveness of information retrieval techniques for large, full-text databases, Improving the eeectiveness of routing techniques appropriate for long-term information needs, and Demonstrating the eeectiveness of these retrieval and routing techniques for Japanese full text databases 5]. Our general approach to achieving these goals has been to use improved representations of text and information needs in the framework of a new model of retrieval. This model uses Bayesian networks to describe how text and queries should be used to identify relevant documents 7, 4, 8]. Retrieval (and routing) is viewed as a proba-bilistic inference process which compares text representations based on diierent forms of linguistic and statistical evidence to representations of information needs based on similar evidence from natural language queries and user interaction. Learning techniques are used to modify the initial queries both for short-term and long-term information needs (relevance feedback and routing, respectively). This approach (generally known as the inference net model and implemented in the INQUERY system 2]) emphasizes retrieval based on combination of evidence. Diierent text representations (such as words, phrases, paragraphs, or manually assigned keywords) and diier-ent versions of the query (such as natural language and Boolean) can be combined in a consistent probabilistic framework. This type of \data fusion" has been known to be eeective in the information retrieval context for a number of years, and was one of the primary motivations for developing the inference net approach. Another feature of the inference net approach is the ability to capture complex structure in the network representing the information need (i.e. the query). A practical consequence of this is that complex Boolean queries can be evaluated as easily as natural language queries and produce ranked output. It is also possible to represent \rule-based" or \concept-based" queries in the same probabilistic framework. This has led to us concentrating on automatic analysis of queries and techniques for enhancing queries rather than on in-depth analysis of the documents in the database. In general, it is more eeec-tive (as well as eecient) to analyze short query texts than millions of document texts. The results of the query analysis are represented in the INQUERY query language which contains a number of operators, such as #SUM, #AND, #OR, #NOT, #PHRASE, and #SYN. These operators implement diierent methods of combining evidence. Some of the speciic research issues we are …
منابع مشابه
Distribution Systems Reconfiguration Using Pattern Recognizer Neural Networks
A novel intelligent neural optimizer with two objective functions is designed for electrical distribution systems. The presented method is faster than alternative optimization methods and is comparable with the most powerful and precise ones. This optimizer is much smaller than similar neural systems. In this work, two intelligent estimators are designed, a load flow program is coded, and a spe...
متن کاملمدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی
In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...
متن کاملبهبود کارایی سیستم کاوشگر کلمات تلفنی با استفاده از نرمالیزاسیون امتیاز اطمینان مبتنی بر روش برنامهریزی خطی
Conventional word spotting systems determine hypothesized keywords and their confidence score using a speech recognizer. Acceptance or rejection of these keywords is intended based on comparison of their scores with a specific threshold. It has been proved that confidence score prepared by recognizer is highly dependent on sub-word structure of each keyword. So comparing assigned scores to keyw...
متن کاملA New Statistical Approach for Recognizing and Classifying Patterns of Control Charts (RESEARCH NOTE)
Control chart pattern (CCP) recognition techniques are widely used to identify the potential process problems in modern industries. Recently, artificial neural network (ANN) –based techniques are very popular to recognize CCPs. However, finding the suitable architecture of an ANN-based CCP recognizer and its training process are time consuming and tedious. In addition, because of the black box ...
متن کاملCorefrence resolution with deep learning in the Persian Labnguage
Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...
متن کامل